Cross-Lingual Automatic Speech Recognition Using Tandem Features
نویسندگان
چکیده
منابع مشابه
Transfer learning for cross-lingual automatic speech recognition
In this study, an instance based transfer learning phoneme modeling approach is presented to mitigate the effects of limited data in a target language using data from richly resourced source languages. A maximum likelihood (ML) learning criterion is introduced to learn the model parameters of a given phoneme class using data from both the target and source languages. Each phoneme was modeled us...
متن کاملSpeech emotion recognition with cross-lingual databases
In this paper, we investigate cross-lingual automatic speech emotion recognition. The basic idea is that since the emotion recognition system is based on the acoustic features only, it is possible to combine data in different languages to improve the recognition accuracy. We begin with the construction of a Mandarin database of emotional speech, which is similar to the well-known Berlin Databas...
متن کاملCross-lingual Interpolation of Speech Recognition Models
A method is proposed for implementing the cross-lingual porting of recognition models for rapid prototyping of speech recognisers in new target languages, specifically when the collection of large speech corpora for training would be economically questionable. The paper describes a way to build up a multilingual model which includes the phonetic structure of all the constituent languages, and w...
متن کاملA Cross Gender and Cross Lingual Study on Acoustic Features for Stress Recognition in Speech
We present a systematic study of the acoustic features for emotional stress in university students across gender and language groups. We design a common questionnaire of stress-inducing and non-stressinducing questions in Chinese and English, and interviewed 25 native speakers of Mandarin and 31 native speakers of English, of both gender. We extract 560 acoustic features including as low-level ...
متن کاملVariability of automatic speech recognition systems using different features
The paper describes the use of two recognizers fed by different acoustic features. The first recognizer performs Multiple Resolution Analysis (MRA) while the other recognizer computes JRASTA Perceptual Linear Prediction Coefficients (JRASTAPLP). The two recognizers use the same denoising method but perform different partitions of their acoustic spaces. Experiments with the Italian and Spanish c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Audio, Speech, and Language Processing
سال: 2013
ISSN: 1558-7916,1558-7924
DOI: 10.1109/tasl.2013.2277932